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(57) Abstract 

Cocoa flavour precursor peptides comprising 2-1 1 amino acid residues, in particular the nonapeptide . Ala-^o-Leu ; Ser-Pro-Gly- Asr> 
Val-Phe are isolated and characterized from West African cocoa bens. A DNA sequence comprising the code of the peptides is synthesized, 
and this'is inserted into replicable vectors. A recombinant host cell transformed with an expression vector containing one or more copies 
of the DNA sequence operably connected with control sequences which are recognized by the host cell, is cultivated to form the peptides, 
and these are delated from the cultivation mixture. A cocoa flavour is produced by mixing one or more of the peptides with predominantly 
Cueing saccharides and amino acids and roasting the mixture. The cocoa flavour may be added to food products, cosmetic products or 
pharmaceutical products or may be formed in situ in these. 
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Cocoa flavour precursor peptides, DNA encoding them, 
processes for producing the peptides, and their use for 
generating cocoa flavour 



5 

This invention concerns peptides which are cocoa flavour 
precursors, DNA encoding these peptides, vectors contain- 
ing the DNA, host cells transformed therewith, and proc- 
esses for producing the peptides as well as their use for 
10 generating cocoa flavour. The peptides are isolated and 
characterized from West African cocoa beans isolated from 
the cocoa tree (Theobroma cacao) . 

BACKGROUND OF THE INVENTION 

15 

Cocoa beans are seeds in cocoa pods which, after harvest- 
ing, are freed from the pods and subjected to a fermenta- 
tion process at or near the cultivation site, following 
which the greater part is exported for industrial proc- 

20 essing. Fermented cocoa beans are roasted, giving rise to 
the characteristic chocolate or cocoa flavour. The subse- 
quent grinding produces cocoa mass which is included as a 
main component in the chocolate production. Frequently, 
part of the cocoa mass is pressed, resulting in cocoa 

25 butter and cocoa powder, respectively. 

The fermentation process generates heat, ethanol and in 
particular acetic acid, and the microorganisms as such 
participate only indirectly in the process. The heat ac- 

30 tivates e.g. protein, oligosaccharide and polysaccharide 
cleaving endogenous enzymes, which are again inactivated 
in the last part of the fermentation process by rela- 
tively large amounts of acetic acid. Acetic acid diffuses 
into the fermented beans and, in addition to direct in- 

35 fluence on the degradation pattern and the rate, also ex- 
erts an indirect influence. The latter effect consists in 
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500 different types of molecules have been detected in 
water vapour distillate from roasted cocoa beans. 

Cocoa flavour mainly consists of volatile components, but 
5 the sensory experience is a combination of taste and 
smell sensation. Mainly two groups of chemical compounds 
contributing to the flavour sensation are formed during 
roasting. These are aldehydes which are formed by oxida- 
tive deamination of amino acids , and pyrazines formed as 
10 Maillard reaction products. 

Nor have attempts at replacing the starting material been 
very successful. 

15 It has been attempted to produce coffee substitutes from 
roasted grain or roasted chicory roots. General Foods 
Corporation has taken out one of the earliest patents on 
the production of artificial chocolate flavour by roast- 
ing various mixtures of peptides, amino acids and carbo- 

20 hydrates (US Patent No. 2 845 592, issued on May 20, 
1958). The patent used a wide range of vegetable and ani- 
mal hydrplysates , and both chemical and enzymatic hy- 
drolysis. Hydrolysis degree of protein, concentration ra- 
tio of reactants to roasting temperature are examined. 

25 Preferred parameters are disclosed, and there are many 
examples of the production of cocoa flavour substitutes 
and use either alone or in combination with other sub- 
stances. The patent represents one of the earliest lit- 
erature references for cocoa flavour substitutes from 

30 other raw materials, and is drafted in very broad terms. 
An example of corresponding, but more recent patents in 
which it has been attempted to use protein hydrolysates 
for producing cocoa flavour, is DDR Patent No. 205 815 f 
published on January 11, 19 84, which preferably concerns 

35 enzymatically produced protein hydrolysates of gelatine 
and wheat gluten. 
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However, none of the processes referred to has been com- 
mercially successful, the reason presumably being that 
the flavour quality is not sufficiently good. 

It is described in the International Patent Applications 
No. WO 91/19800 and No. WO 91/19801 from MARS UK Ltd., 
both published on December 26, 1991, how proteins corre- 
sponding to molecular sizes of 47 kD, 31 kD and 21 kD, 
respectively, were isolated from ether and acetone ex- 
tracted powders of ground ripe cocoa beans. These are 
presumed to be subunits of the storage proteins of the 
cocoa bean. The polynucleotide sequences were identified, 
N-terminal amino acid sequences were determined, and a 
range of polyclonal specific antibodies for polypeptide 
identification was produced. A 67 kD precursor of said 47 
and 31 kD proteins was identified and characterized. 

Correspondingly, a 23 kD precursor of the 21 kD protein 
was identified and characterized. DNA encoding said 21 
kD, 23 kD, 47 kD and 67 kD proteins was cloned in yeast. 

The patent claims of the applications claim protection 
for the mentioned proteins and for fragments thereof 
which might conceivably be of importance for the flavour 
generation. Protection is also claimed for nucleic acids 
encoding these proteins and fragments, for their incorpo- 
ration in vectors and for host cells containing these. 
However, it is remarkable that there is no documentation 
whatsoever as to which fragments might be of importance 
for the flavour formation, or as to how such fragments 
are to be produced. 



35 
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SUMMARY OF THE INVENTION 

The following examples describe how a peptide having an 
almost optimum flavour potential has been identified in 
5 West African cocoa pods, following which the amino acid 
sequence was determined. An oligonucleotide was synthe- 
sized , encoding a fusion sequence between the peptide and 
the coding sequence of a blood factor Xa cleavage site, 
and the oligonucleotide was ligated into the vector pGEX- 

10 1 (Smith, D . B . & Johnson, K.S. (1988) Gene 62, 31-40), 
which contains a gene encoding glutathione-S-transf erase, 
in extension of this gene. E. coli TGI (Amersham) was 
transformed with the vector, and the fusion protein was 
expressed (Sikorski, R.S. & Hieter, P. (1989) Genetics 

15 122 . 19-27). The fusion protein was isolated by means of 
a glutathione "Agarose"® affinity column. Fusion protein 
so isolated and containing blood factor Xa cleavage site 
was cleaved with factor Xa and applied to the affinity 
column once more, whereby glutathione-S-transf erase was 

20 retained on the column, and the peptide of interest was 
eluted. The eluate was gel-filtered on a "Superdex®75 " 
column (Pharmacia) by means of Pharmacia FPLC equipment. 
The fraction containing the peptide was rechromatographed 
on reverse phase column, following which the identity of 

25 the peptide was confirmed by means of mass spectrometry 
and amino acid sequence determination by Edman degrada- 
tion . 

Accordingly, the invention provides a cocoa flavour pre- 
30 cursor peptide selected from an isolated peptide with the 
amino acid sequence: 

Lys -Ala-Pro-Leu-Ser-Pro-Gly-Asp-Val-Phe-Val 
and fragments thereof containing 2-10 amino acid resi- 
dues . 
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In particular, the peptide of the invention is selected 
from fragments of the peptide with the above-mentioned 
sequence containing 2-9 amino acid residues calculated 
from the alanine residue No. 2, and it is preferably a 
nonapeptide with the amino acid sequence: 

Ala-Pro-Leu-Ser-Pro-Gly-Asp-Val-Phe . 

The invention also comprises a DNA isolate comprising a 
DNA sequence encoding a peptide as stated above, which 
isolate, however, does not include the coding sequences 
of the 67 kD, 4 7 kD and 31 kD cocoa proteins. 



The DNA isolate of the invention comprises in particular 
the DNA sequence: 

5 ' -AAR-GCN-CCN-^^^-^^.cCN-GGN-GAY-GTN-TTY-GTN-3 ' 
or parts thereof of at least two codons in reading frame 
from the 5 ' -terminus , and preferably the DNA sequence: 

5 '-GCN-CCN-J^-TCN^ ccn ^ ggn _ gay _ gtn _ tty ^ 3 , 

or parts thereof of at least 2-8 codons from the 5'-ter- 
20 minus in reading frame therefrom. 

A particularly useful DNA isolate of the invention, which 
comprises the coding sequences of a blood coagulation 
factor Xa cleavage site and of the above-mentioned non- 
25 apeptide as well as various restriction sites, is useful 
for ligation in vectors which contain a gene encoding a 
larger protein, so that these express a fusion protein 
which is easier to purify, and from which the nonapeptide 
can easily be released by factor Xa. This DNA isolate has 
30 the DNA sequence: 

5 • -GATCTTGGATCC-ATCGAGGGTCGTGCCCCATTGTCACCTGGTGACGTCTTTTAG-3 ' 

3 • -AACCTAGG— TAGCTCCCAGCACGGGGTAACAGTGGACCACTGCAGAAAATCTTAA-5 • 



35 



The invention moreover comprises vectors which contain 
the sequence of one of the above-mentioned DNA isolates, 
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and in particular expression vectors which contain one or 
more copies of such a sequence operably linked to con- 
trol sequences which are recognized by a host cell trans- 
formed with the vector. 

5 

Also recombinant host cells transformed with these vec- 
tors are comprised by the invention. Such host cells may 
be prokaryotes, e.g. Escherichia coli, or eukaryotes , 
e.g. yeast, mycelial fungi or cell lines of multi-cell 
10 organisms. Yeast, which is a well-known microorganism 
widely used in the food industry, must be considered par- 
ticularly useful for producing cocoa flavour precursor 
peptides of the invention. 

15 The invention moreover comprises various processes for 
producing the peptides of the invention. 

Firstly, there is the process which was first used for 
forming and isolating the peptides from their natural 

20 sources, comprising freeing ground cocoa beans of lipids 
by extraction with an organic solvent and washing with 
acetone and an aqueous acidic buffer solution and then 
incubating the ground cocoa beans with an aqueous acidic 
buffer solution for autolysis of the proteins, following 

25 which the mass is extracted with methanol, and the ex- 
tract is applied to a strong cation exchange column, from 
which the peptide fraction is eluted with a strong base 
and rapidly neutralized, and the desired peptides are 
isolated by chromatography. 

30 

Secondly, the one best suited for industrial production 
of the peptides, viz. by cultivation of a culture of a 
recombinant host cell, as stated above, and isolation of 
the resulting peptide from the cultivation mixture. 
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Finally, the peptides of the invention may also be pro- 
duced by chemical synthesis from the individual amino ac- 
ids . 

5 The use of the peptides of the invention for producing 
cocoa flavour is also comprised by the invention. 

Moreover, the invention comprises cocoa flavour produced 
by mixing of one or more peptides of the invention with 
10 predominantly reducing saccharides and amino acids and 
subsequent heat treatment of the mixture for 1-60 min at 
100-200 °C, preferably for 5-15 min at 110-150 °C. In 
such a cocoa flavour, the quantitative proportion between 
peptide(s), saccharides and amino acids is usually 
15 peptide(s) 30-90% by weight 

saccharides 10-40% by weight 

amino acids 0-30% by weight 

and preferably 

peptide (s) 50-80% by weight 

20 saccharides 15-35% by weight 

amino acids 5-15% by weight 

based on the total amount of these ingredients. The sac- 
charides in the mixture may practically consist of fruc- 
tose or glucose or mixtures thereof, preferably a mixture 
25 of fructose and glucose in a weight ratio from 3:1 to 
lt3 . 



The invention additionally comprises food products, cos- 
metic products and pharmaceutical products which have 
30 added thereto or contain a cocoa flavour, as stated 
above. Advantageously, the food products may be choco- 
late, confectionery, pastry or soft drinks. A particular 
embodiment of such products has been achieved in that 
during production they have been mixed with one or more 
peptides of the invention and, if necessary, predomi- 
nantly reducing saccharides and amino acids and then sub- 
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jected to a heat treatment for 1-60 min at 100-200 °C, 
preferably for 5-15 min at 110-150 °C . 

BRIEF DESCRIPTION OF THE DRAWINGS 

5 

Figure 1 shows a typical reverse phase chromatogram of an 
extract of defatted and autolyzed cocoa beans with 70% 
aqueous methanol." The nonapeptide Ala-Pro-Leu-Ser-Pro- 
Gly-Asp-Val-Phe having a particularly high cocoa flavour 
10 potential is isolated from the peak of the chromatogram 
which is marked by an arrow. 

Figure 2 shows a reverse phase chromatogram of said nona- 
peptide produced by a chemical synthesis. 

15 

Figure 3 shows a reverse phase chromatogram of a peptide 
material isolated from an E. coli strain which has been 
transformed with a plasmid containing the code of said 
nonapeptide sequence. The nonapeptide is detected in the 
20 peak of the chromatogram which is marked by an arrow. 

Chromatography conditions of the three reverse phase 
chromatograms : 

25 COLUMN: PEP-RPC HR 16/10 (Pharmacia). 

MOBILE PHASE: Acetonitrile gradient, 0-20% acetonitrile 
of 30 min followed by 20-100% acetonitrile 
of 10 min in 0.1% trif luoroacetic acid 
30 (TFA) . . 

FLOW RATE OF MOBILE PHASE: 7 ml /min. 

DETECTION: UV at 214 and 2 80 nm. 
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The acetonitrile gradient is plotted in the figures. The 
gradient is plotted so that 0% acetonitrile is found at 
the base line of 214 run detection. The base line of 280 
n» detection is raised with respect to the base line of 
214 nm detection on the chromatogram. 

DETAILED DESCRIPTION OF THE INVENTION 

EXAMPLE 1 

Processing of peptides ^ 

Ripe, fresh cocoa pods from the Gold Coast, Ivory Coast 
were used for processing as described in this example 
What was involved was a hybrid, widely distributed in the 
regxon, between two traditional cocoa tree types, Criollo 
and Amelonado, representing the most important African 
Forastero type. 

After purification and disinfection in ethanol, cocoa 
pods were divided into two halves by a sterile scalpel. 
The pulp was removed, and the beans were frozen in liquid * 
nitrogen before drying in a freeze drier. * " W 

immediately before extraction of lipids, dried pulp resi- 
dues and shell parts were removed, and the beans were 
crushed in a mill having a tight screen. Beans thus 
ground were mixed with petroleum ether and extracted in a 
Soxhlet device. Then the mass was filtered and the resi- 
due washed with cold acetone. Furthe r washing was per- 
formed on an ice bath with 70% acetone admixed with 0 15% 
thioglycolic acid until no more colour was released.' To 
remove the residual water, washing was completed with 
pure acetone, and the remaining so-called acetone residue 
contained proteins and protein-like compounds. 
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The acetone residue was then washed with cold 0.05 M cit- 
rate buffer admixed with 0.15% thioglycolic acid and 10 
mM EDTA at pH 4.0. The washing procedure was repeated 
with a large excess of buffer. The acid washed acetone 
5 residue thus produced and containing the greater part of 
the proteins and protein-related compounds (including en- 
dogenic enzymes ) occurring in the beans was incubated 
with stirring at 50 °C in 0.2 M citrate buffer admixed 
with 0.5% thiogycolic acid at pH 4.0. 

10 

After 2 4 hours the incubation was interrupted and the hy- 
drolysate admixed with cold methanol to a final concen- 
tration of 70% by volume and an extraction volume of 
about 20 times the weight of defatted beans. Cold extrac- 
15 tion (0-4 °C) was effected for 1/2 hour. Then extraction 
was effected once more with 70% by volume methanol r and 
the extracts were pooled and filtered. 

To reduce any coloration of the extract, which may be as- 

20 cribed to polyphenoloxidase activity in the plant tissue, 
adsorption was effected to polyvinyl polypyrrolidone 
(PVPP) at pH 2.5. Peptides and amino acids in the extract 
were then bound to washed and equilibrated strong cation 
exchanger ( "Dowex® 50W" ), about 2.5 ml of wet ion ex- 

25 changer per gram of defatted bean. Then washing was per- 
formed in sequence with 20 and 80% 2-propanol followed by 
water to remove residual alcohol. The peptide fraction 
was liberated by basic elution (pH 10.7-11.0), and after 
the elution the pH value was lowered as quickly as possi- 

30 ble to about 7 by addition of HC1. The fraction was de- 
salted by means of cation exchanger and analyzed by re- 
verse phase chromatography (RPC) . For roasting purposes, 
bound peptides were eluted with ammoniumhydroxide (pH 
12), which was subsequently removed as well as possible 

35 either by placement in an incubator under vacuum at 40 °C 
overnight or by f reeze-drying. 



WO 96/38472 



12 



PCT/DK96/00230 



Chromatography 
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Alanine 10 

Arginine 7 

Aspargine 3 

Aspartic acid 3 

5 Glutamine 3 

Glutamic acid 3 

Glycine 2 

Histidine 2 

Isoleucin 5 

10 Leucin 16 

Lysine 7 

Methionine 1 

Phenylalanine 14 

Serine 5 

15 Threonine 4 

Thyroxin 8 

Valine 7 



100 

20 Variations in the amino acid and sugar composition from 
what is stated did not necessarily change the character 
of the flavour by the roasting. 

Evaluation 

25 

An in-house sensory panel was taught to reproducibly 
evaluate the most essential positive as well as negative 
flavour characters of thin layer roasted samples. The 
standard used was an ethanol extract from fermented, non- 
30 roasted cocoa beans as well as cocoa powder. 

A result of initial studies was that the pH of the eluate 
should be above 8 for a good flavour development to be 
achieved at all, and the pH should preferably be in the 
35 range of 8 to 10. It was held that the lower pH limit was 
about 6, below which no good flavour development could be 



14 



PCT/DK96/00230 



obtained, even with eluates of fi* 

tides ° f flavour Precursor pep- 

~vir p :rrr £ound that b ^ 

It could be demonstrated that th Q 
Potential It „ as A7 er Tou nd b 

«- amino ;;: ys T/::; t T; trophoto " etric 

sisted cf nine ami „o scids ^ ' C ° n " 
Sar-Pro-Gly-As., v»l „K »«Juenc. Ala-Pro-Leu- 

poiyphano! oxidase J** **» —anca tha t the 

F eins are pronounced in cas- 0 * 7 ^ 
process periods with access of oxygen. " 9 

To illustrate results of a 

- ............ .„.".*„: ,: »•• .»« 

two of which did not contain peptide. 



WO 96/38472 PCI7DK96/00230 

15 



Fraction No. Cocoa flavour Off flavour sensation 
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Fraction 38 was held to have a great flavour potential 
and was found to contain the above-mentioned nonapetide. 



30 Molecular cloning 



With a view to molecular cloning in E. coli of a nucleo- 
tide sequence corresponding to the identified nonapep- 
tide, a relevant oligonucleotide having the following 
35 structure was synthesized: 
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Bglll site - BamHI site - code of Xa recognition sequence 
- code of nonapeptide - stop codon - EcoRl site 

as well as the complementary sequence. 

5 

The two strands were purified by polyacrylamide gel elec- 
trophoresis (PAGE) on 6% acrylamide gels containing urea 
(Sambrook, J., Fritsch, E.F., and Maniatis, T., (1989) 
in: Molecular Cloning - A Laboratory Manual". 11.23-11.28. 
10 2nd ed., Cold Spring Harbor Lab. Press). Purified oli- 
gonucleotide strands were annealed to form the following 
double strand: 



15 



20 



25 



30 



5 • -GATCTTGGATCC-ATCGAGGGTCGTGCCCCATTGTCACCTGGTGACGTCTTTTAG-3 • 

3 • -AACCTAGG-TAGCTCCCAGCACGGGGTAACAGTGGACCACTGCAGAAAATCTTAA-5 • 



Seen from the 5 '-end, the first five nucleotides consti- 
tute the greater part of the Bglll restriction site, 
which is AGATCT. The subsequent T is inserted to provide 
a correct reading frame. The two subsequent triplets, GGA 
and TCC, encode Gly and Ser, respectively, and constitute 
a BamHI restriction site which is inserted as a marker 
wxth a view to optional later PGR reaction. The next four 
tnplets, ATC-GAG-GGT-CGT , encode Ile-Glu-Gly-Arg which 
15 thS rec °9 ni tion sequence of blood coagulation factor 
Xa, a very specific proteolytic enzyme which cleaves on 
the carboxyl side of arginine, so that the nonapeptide 
starts w lt h the correct N-terminus , alanine. The triplets 
No. 4 and No. 3 from the 3 ' -end of this synthetic oli- 
gonucleotide, which encode Asp-Val , are selected from the 
genetic code so as to form a BsaHl restriction site, 
GACGTC. The last triplet on this strand, TAG, is a stop 
codon. The TTAA sequence at the 5 '-end on the other 
strand constitutes four of the six nucleotides of the 
35 EcoRl restriction site. 
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EcoRI and Bglll restriction was used for ligation into 
the pGEX-1 vector (Smith, D.B. & Johnson, K.S. (198 8) 
Gene , 31-40), which contains a gene encoding glutathione- 
s-transferase localized so that expression in transformed 
5 bacteria causes synthesization of a fusion product be- 
tween this protein and the nonapeptide, which i.a. fa- 
cilitates purification and control of the expression 
product . 

10 The ligated plasmid was used for transformation of E. 
coli strain TGI supplied by Amersham (Hanahan, D. (1983), 
J. Mol. Biol. 166 , 557-580). The above-mentioned BsaHI 
restriction site was introduced into the synthetic oli- 
gonucleotide to enable control of recombinant plasmid 

15 preparations by restriction mapping (Sambrook, J. et al . 
(1989), Molecular Cloning). The correct recombinant plas- 
mids were sequenced by the dideoxy method (Sanger, F., 
Nicklen, S. r and Coulson, A.R., (1977) Proc . Natl. Acad. 
Sci. USA 7J., 5463-5467; Sambrook, J. et al - (1989) Mo- 

20 lecular Cloning) . 

A selected strain of Escherichia coli containing the cor- 
rect recombinant plasmid has been deposited under the 
conditions of the Budapest Treaty in Centraalbureau voor 
25 Schimmelcultures , Oosterstraat 1, P.O. Box 27 3, NL-3740 
AG Baarn, Holland, with the Accession Number CBS 552.94. 

Transformed E. coli was cultivated in shaking bottles to 
AgQQ of 0.7-1.0 at 28 °C , following which IPTG was added 
30 to a concentration of 0.1 mM for induction of the tac 
promoter. The cultures were cultivated for another 3-5 
hours and then harvested. 

Pelleted E. coli was resuspended in lyse buffer (50 mM 
35 Tris HC1, pH 8.0, 0.2 mg/ml of lysozyme, 1 mM EDTA) about 
1:1 (weight/vol . ) The suspension was incubated for 5 min- 
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utes at room temperature, and 0.04 (w/v) of 2% deoxycho- 
late and 100 units/ml of "Benzonase" were added. This 
suspension was kept on ice for 30 minutes, and then cell 
residues were centrifuged off. 

The supernatant was applied to a glutathione "Agarose"® 
affinity column equilibrated in 50 mM Tris HC1, p H 8.0 at 
8 °C. The column was washed with a buffer, and the fusion 
protein was eluted with a buffer admixed with 5 mM re- 
duced glutathione, dial yzed and analyzed by SDS poly- 
acrylamide gel electrophoresis on an 18% polyacrylamide 
gel. The protein concentration was determined by means of 
the Bradford method. The yield of fusion protein was de- 
termined to be about 12 mg per g of E. coli cells (wet 
15 weight). 

Factor Xa cleavage (Nagai, K. & Thogersen, H.C. (1984) 
Nature 30£, 810-812) was performed as described by Knud- 
sen et al . (Knudsen, C.R., Clark, B.F.C., Degn, B., and 
Wiborg, O., (1992) Biochem. Int. 28, 352-362) with a few 
modifications. The weight ratio of protease to substrate 
was constantly kept at 1:200. After cleavage, affinity 
chromatography was again performed on the glutathione 
"Agarose"® affinity column, and pure nonapeptide was 
25 collected from the eluate. 

Using laser mass spectrometry, the mass of the fusion 
protein and of the glutathione-S-transf erase part of 
cleaved fusion protein, was determined to 27 311 and 
26 409, respectively, which, in view of the uncertainty 
of the method, corresponds to a difference that might be 
ascribed to the nonapeptide. 

FPLC analyses showed that a gel filtration ( "Superdex® 
75 ) was necessary to remove various contaminants from 
the nonapeptide. Then the same elution profile was re- 
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vealed under RPC (figure 3) as for nonapeptide isolated 
from cocoa beans and for nonapeptide produced by chemical 
synthesis. Plasma desorption mass spectrometry verified 
the identity of the microbially synthesized peptide. 

5 

EXAMPLE 2 

The sequence of the identified nonapeptide may be found 
as the amino acid residues Nos. 457 to 465 in the amino 

10 acid sequence of 6 7 kD cocoa seed storage protein precur- 
sor derived from the cDNA sequence ( International Patent 
Application No. WO 91/19801) and in the amino acid se- 
quence of cocoa seed vicilin derived from the gene se- 
quence (McHenry, L. & Fritz, P.J. (1992), Plant Mol. 

15 Biol. 18., 1173-1176). 

The nonapeptide isolated from cocoa beans is generated by 
endogenic enzyme activity and thus represents naturally 
produced peptides. The cleavage pattern reflects the en- 

20 dogenic enzyme activities under the given physical cir- 
cumstances, and, of course, it is conceivable that 
slightly changed conditions might give rise to new pep- 
tides that might have a unique flavour potential. There- 
fore, the present study comprised studying the flavour 

25 potential of the nonapeptide extended by the next N-term- 
inal amino acid, lysine, and the next C-terminal amino 
acid, valine, occurring in the cocoa storage protein. In 
addition, a plurality of minor peptides was studied , 
whose identities are set forth below. The peptides were 

30 synthesized by chemical methods and then purified by HPLC 
prior to tests in roasting experiments . 
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Nomenclature/peptide identity 

Ala-2: Ala-Pro 
Ala- 3: Ala-Pro-Leu 
Ala-6: Ala-Pro-Leu-Ser-Pro-Gly 
Ala- 7 : Ala-Pro-Leu-Ser-Pro-Gly-Asp 

Ala-Pro-Leu-Ser-Pro-Gly-Asp-Val 

Ala-Pro-Leu-Ser-Pro-Gly-Asp-Val-Phe 
Al a -Pro-L e u- Ser -p ro _ Gly _ Asp _ Val _ phe _ Vai 



Ala-8: 
Ala-9: 
Ala-10 



Pro-7: 
Pro-8: 



Pro-Leu-Ser-Pro-Gly-Asp-Val 
Pro-Leu-Ser-Pro-Gly-Asp-Val-Phe 
Lys-10: ^-Ala-Pro-Leu-Ser-Pro-Gly-Asp-Val-Phe 



15 Roasting - sensory evaluation 

Portions of about 20 mg each were prepared as described 
under ..roasting., and roasted for 8 mi nutes at 130 

20 :i I W T 6ValUated Sh ° rtly aft - «- roasting pi, 
traced individuals participated in the sensory la 

a :i: he evaiuat±ons were t i y y of a :: h 

^ evaluated 5 "^ ^ Sessions 
were evaluated accordinq to a tr-ai*. w - 

^ . scale discussed and an- 
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SAMPLE 


SCORE 


SAMPLE 


SCORE 


Ala -2 


3 






Ala-3 


4 






Ala-6 


3 






Ala-7 


1 . 


Pro-7 


2 


Ala-8 


4 


Pro- 8 


3 


Ala-9 


6 






Ala-10 


1H 


Lys-10 


1 



The nonapeptide was thus given a very positive evaluation 
when alanine was N-terminal, and thus confirmed the ob- 
5 servations from the cocoa bean isolate- Most of the minor 
peptides exhibited a not inconsiderable flavour poten- 
tial , and thus confirmed previous observations with frac- 
tionated cocoa bean isolate. In all experiments, the no- 
napeptide Ala-9 was evaluated as the clearly best one and 
10 being unique. 

When lysine was N-terminal (Lys-10), the flavour poten- 
tial was given a rather low evaluation. If this is com- 
pared with the evaluation of Ala-10 as well as visual ob- 
15 servations during and after the roasting experiments, it 
is strongly indicated that the solubility/miscibility be- 
comes problematic with this and greater chain lengths . 

Off-flavours of a varying nature and intensity were 
20 evaluated in many samples, apart from the very best ones. 
It should be stressed in this connection that an unpleas- 
ant pungent odour frequently occurs when proline is the 
N-terminal amino acid. This may very well be ascribed to 
the fact that proline contains imine as a functional 
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group, which may be of great importance to the Mall lard 
reaction procedure. 

Further, it is worth noting that endogenic enzymes, which 
are responsible for the formation of the nonapeptide dur- 
ing incubation of cocoa beans, may have a great resemb- 
lance to trypsin and chymotrypsin and/or pepsin, respec- 
tively. This in order to be able to generate the correct 
terminal amino acids, alanine and phenylalanine. 
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SEQUENCE LISTING 

(1) GENERAL INFORMATION: 

(i) APPLICANT: 

(A) NAME: Aarhus Oliefabri_k A/S 

(B) STREET: P.O. Box 50 

(C) CITY: Aarhus C 

(E) COUNTRY: Denmark 

(F) POSTAL CODE (ZIP): DK-8100 

(ii) TITLE OF INVENTION: Cocoa flavour precursor peptides , DNA 
encoding them, processes for producing the peptides, and 
their use for generating cocoa flavour 

(iii) NUMBER OF SEQUENCES: 6 

(iv) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Floppy disk 

(B) COMPUTER: IBM PC compatible 

(C) OPERATING SYSTEM: PC-DOS /MS-DOS 

(D) SOFTWARE: Patentln Release #1.0 f Version #1.30 (EPO) 

(2) INFORMATION FOR SEQ ID NO: 1: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 33 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTCHETICAL: NO 
(iv) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Theobroma cacao 

(B) STRAIN: Forastero 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 1. .33 

(C) IDENTIFICATION METHOD: experimental 

(D) OTHER INFORMATION : /codon_start= 1 

/function= "Cocoa flavour precursor" 

/ product= " Peptide " 

/evidence^ EXPERIMENTAL 

/ trans ljexcept= (pos: 10 12 , aa: Leu) 

/trans ljexcept= (pos: 13 .. 15 , aa: Ser) 

/note= "The hendecapeptide and fragments thereof 

comprising 2-10 amino acid residues are useful 

cocoa flavour precursors" 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1: 



AARGCNCCNN NNNNNCCNGG NGAYGTNITY GTN 
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(2) INFORMATION FOR SEQ ID NO: 2: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 11 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: NO 
(iv) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Theobroma cacao 

(B) STRAIN: Forastero 

(ix) FEATURE: 

(A) NAME/KEY: Peptide 

(B) LOCATION: 1.. 11 

(D) OTHER INFORMATION : / label = Hendecapeptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2: 

Lys Ala Pro Leu Ser Pro Gly Asp Val Phe Val 
1 5 10 

(2) INFORMATION FOR SEQ ID NO: 3: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 27 base paixs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 

(iv) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Theobroma cacao 

(B) STRAIN: Forastero 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION:!. .27 

(C) IDENTIFICATION METHOD: experimental 

(D) OTHER INFORMATION: /function= "Cocoa flavour precursor" 

/product= "Peptide" F 
/ evidence= EXPERIMENTAL 
/transl_except= (pos: 7 9, aa: Leu) 
/transl_except= (pos: 10 12, aa: Ser) 
/note= "The nonapeptide is a potent cocoa flavour 
precursor" 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3: 
GCNCCNNNNN NNCCNGG^GA YCTNTTY 27 



(2) INFORMATION FOR SEQ ID NO: 4: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 9 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 
(iii) HYPOTHETICAL: NO 

(v) FRAGMENT TYPE: internal 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Theobroma cacao 

(B) STRAIN: Forastero 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4: 

Ala Pro Leu Ser Pro Gly Asp Val Phe 
1 5 



(2) INFORMATION FOR SEQ ID NO: 5: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 54 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 
(iv) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Theobroma cacao 

(B) STRAIN: Forastero 

(ix) FEATURE: 

(A) NAME/KEY: misc_recomb 

(B) LOCATION:!. .5 

(D) OTHER INFORMATION: /note= "Larger part of Bglll 
restriction site which is AGATCT" 

(ix) FEATURE: 

(A) NAME/KEY: niisc_recomb 

(B) LOCATION: 7. .12 

(D) OTHER INFORMATION :/note= "A BamHI restriction site, GGATCC 
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(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 13. .51 

Snl ™r iFI CATION METHOD: experiitiental 
(D) OTHER INFORMATION: /product "Fused peptide" 
/evidence= EXPERIMENTAL 

/ ?o^iZ^ Si ° n °f recognition sequence Ile-Glu-Gly-Arq 
for blood coagulation factor Xa cutting 3' of Ara and 
cocoa flavour precursor nonapeptide" Arg and 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5: 

GATCTTGGAT CC ATC GAG OCT CGT GCC CCA TTG TCA CCT GGT GAC Glf 

He Glu Gly Arg Ala Pro Leu Ser Pro Gly Asp Val 48 
5 10 

TTT TAG 

Phe 54. 

(2) INFORMATION FOR SEQ ID NO: 6: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 13 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 6: 

He Glu Gly Arg Ala Pro Leu Ser Pro Gly Asp Val Phe 
5 10 
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PATENT CLAIMS 

1. A cocoa flavour precursor peptide selected from a 
peptide with the amino acid sequence (SEQ ID No. 2): 

5 Lys-Ala-Pro-Leu-Ser-Pro-Gly-Asp-Val-Phe-Val 

and fragments thereof containing 2-10 amino acid resi- 
dues . 

2 . A peptide according to claim 1 selected from f rag- 
10 ments of the peptide SEQ ID No, 2 containing 2-9 amino 

acid residues calculated from the alanine residue No. 2 
in SEQ ID No. 2. 

3. A peptide according to claim 1 or 2 which is a nona- 
15 peptide with the amino acid sequence (SEQ ID No. 4): 

Ala-Pro-Leu-Ser-Pro-Gly-Asp-Val-Phe . 

4. A DNA isolate comprising a DNA sequence encoding a 
peptide according to any one of claims 1-3 f which iso- 

20 late, however, does not include the coding sequences of 
the 67 kD, 47 kD and 31 kD cocoa proteins. 

5. A DNA isolate according to claim 4 which comprises 
the DNA sequence (SEQ ID No. 1): 

25 5 ' -AAR-GCN-CCN-^^^-^^^-.cCN-GGN-GAY-GTN-TTY-GTN-3 ' 

or parts thereof of at least two codons in reading frame 
from the 5 '-terminus. 

6. A DNA isolate according to claim 4 or 5 which com- 
30 prises the DNA sequence (SEQ ID No. 3): 

5 ' -GCN-CCN-^^-^^-CCN-GGN-GAY-GTN-TTY-3 ' 

or parts thereof of 2-8 codons from the 5 ' -terminus in 
reading frame therefrom. 



WO 96/38472 



28 



PCT/DK96/00230 



whi A K iS ° late aCCOrdin ^ to one of claims 4-6 

which has the DNA sequence (SEQ ID No. 5): 

5 • - g atc T tggatcc-atcga' G ggtcgtgccccattgtcacctggtgacgtcttttag-3 • 

3 • -AACCTAGG-TAGCTCCCAGCACGGGGTAACAGTGGACCACTGCAGAAAATCTTAA-5 ■ . 

8- A replicable vector containing the sequence of a DNA 
isolate according to any one of claims 4-7. 

9. An expression vector containing one or more copies of 
a DNA isolate according to any one of claims 4-7 operablv 
connected with control sequences which are recognized by 
a host cell transformed with the vector. 

10. A recombinant host cell transformed with a vector 
15 according to claim 8 or 9 . 

11. A recombinant host cell according to claim 10 which 
veast 00 1 i 



10 
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30 



35 



is a yeast cell 



20 12 



A recombinant host cell according to claim 10 which 
is a cell of Escherichia coli. 

13 A recombinant host cell according to claim 12 which 
is E. coli CBS 552.94. 

14. A biologically pure culture o£ . reconlbinant ^ 
cell according to any one o£ claims 10-13. 

15. A process £or producing a peptide according to any 
one of claims 1-3 wherein ground coco, beans are freed of 
irprds by extraction with ,„ organic solvent and washing 
wrth acetone and an , q „ e „„s acidic buffer so lu tion and 
are then incubated with an agueous acidic buffer solution 
for autolysis of the proteins, following „ hich the mass 
" « t » e *- »"h -th.no! , and the extract is applied to 

strong cation exchange co!umn, from which the peptide 
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fraction is eluted with an aqueous base and is rapidly 
neutralized, and the desired peptides are isolated by 
chromatography . 

5 16. A process for producing a peptide according to any 
one of claims 1-3 by cultivation of a culture according 
to claim 14 and isolation of the resulting peptide from 
the cultivation mixture, 

10 17- A process for producing a peptide according to any 
one of claims 1-3 by synthesis from the individual amino 
acids . 

18- Use of a peptide according to any one of claims 1-3 
15 for generating cocoa flavour. 

19. A cocoa flavour produced by mixing one or more pep- 
tides according to any one of claims 1-3 with predomi- 
nantly reducing saccharides and amino acids and subse- 

20 quent heat treatment of the mixture for 1-60 min at 100- 
200 °C, preferably for 5-15 min at 110-150 °C . 

20. A cocoa flavour according to claim 19 wherein the 
quantitative proportion between peptide(s), saccharides 

25 and amino acids is 

peptide(s) 30-90% by weight 

saccharides 10-40% by weight 

amino acids 0-30% by weight 

based on the total amount of these ingredients . 

30 

21. A cocoa flavour .according to claim 20 wherein the 
quantitative proportion between peptide(s), saccharides 
and amino acids is 
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peptide (s) 

saccharides 5 ° _80% hy Wei ^ ht 

amino acids 15 ' 35% by Wei 9 ht 

based on the tm- a i 5-15% by weight 

tne total amount of these, i ■ 

i-nese ingredients. 

mixtures thereof fructose or gl ucose or 



mixtures thereof 
10 23 



- a »eig„t r.t*o „ f t. ^ ^"^ ^ 5l "- 

» coco/n"l~ Il iC t h h - «~~> ~ -tains . 

C ° rdlng to a "y one of claims 19-23. 

23. according to any one of claims 19- 

» con^^rnrorrut product - hich has — « 

" aV ° Ur to any one of claims 

™r '° - - - -aims 24 .„ to 
- to any one oj ^ 'a n 7 " 

reducing saccharides and —i! necessa ry, predominantly 
-icn nas tnen b een . e 2 to" neat t^" ^ ^ 
*0 min at 100-300 °c. preferably LZ^T^ ** *" 
o Q y Ior s - 15 mm at 110-150 
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